Capturing Data Using XML Paragraph-centric Documents

نویسندگان

  • Y. Badr
  • M. Sayah
  • F. Laforest
  • A. Flory
  • Albert Einstein
چکیده

Nowadays, graphical forms are used to capture input data and feed traditional databases. They are associated with rigid schema and constraints to ensure data validity and to control entry sequencce. Over the Internet, electronic documents are becoming widely exchanged and their data reused in a large range of tasks. As a direct consequence, a capture interface that relies on documents can serve as a flexible front-end, and as a natural way to capture data. The aim of our research is the development of a comprehensive capturing and mapping data system that ensures flexible and welladapted information capture and at the same time efficient information retrieval. This system introduces a transformation process that allows the mapping between the document model and the traditional database model. We validate our transformation frame-work by implementing a prototype. The primary release of the prototype is subject to a substantial attention.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study of the Automatic Construction of XML Documents Models from a Relational Data Model

End-users information capture remains a sensitive challenge, especially when information is under the form of documents. The difficulty concerns information indexing so that information can be precisely queried. In the DRUID project, the end-user captures XML paragraph-centric documents (i.e. documents with tags delimiting narrative text paragraphs), and a transformation tool generates XML data...

متن کامل

Xtractor: A light wrapper for XML paragraph-centric documents

The emergence of XML leads the development of applications centric XML-documents. Often the documents contain tagged paragraphs of natural language texts. The extraction of relevant data from paragraphs confronts with their irregular structure hidden in the text and requires powerful extraction patterns. Although a large spectrum of wrappers has been conceived to mainly process HTML pages, the ...

متن کامل

NATIVE XML DATABASES vs. RELATIONAL DATABASES IN DEALING WITH XML DOCUMENTS

When dealing with data-centric XML documents, it is possible to convert XML documents into a relational database, which can then be queried using SQL. Such relational databases are called XML-enabled databases. On the other hand, the best choice for storing, updating and retrieving document-centric XML documents is usually a native XML database (NXD). NXDs store XML documents as logical units, ...

متن کامل

Current Approaches to XML Management

The Extensible Markup Language has become the standard for information interchange on the Web. Developed primarily as a document markup language more powerful than HTML yet less complex than SGML, XML does not require content to adhere to structural rules. XML gives a single, human-readable syntax for representing data, including data in relational format. Hence XML appeals to both the document...

متن کامل

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007